Granular Classification for Imbalanced Datasets: A Minkowski Distance-Based Method

نویسندگان

چکیده

The problem of classification for imbalanced datasets is frequently encountered in practical applications. data to be classified this are skewed, i.e., the samples one class (the minority class) much less than those other classes majority class). When dealing with datasets, most classifiers encounter a common limitation, that is, they often obtain better performances on class. To alleviate study, fuzzy rule-based modeling approach using information granules proposed. Information granules, as some entities derived and abstracted from data, can used describe capture characteristics (distribution structure) both classes. Since geometric depend distance measures granulation process, main idea study construct each Minkowski then establish models by “If-Then” rules. experimental results involving synthetic publicly available reflect proposed distance-based method produce series shapes granular satisfying performance datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classification in Imbalanced Datasets

In this thesis we study the classification task in the presence of class imbalanced data. This task arises in many applications when we are interested in the under-represented (minority) classes. Examples of such applications are related to fraud detection, medical diagnosis and monitoring, text categorization, risk management, information retrieval and filtering. Although there exist many stan...

متن کامل

Hybrid classification approach for imbalanced datasets

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vi CHAPTER

متن کامل

Margin-Based Over-Sampling Method for Learning from Imbalanced Datasets

Learning from imbalanced datasets has drawn more and more attentions from both theoretical and practical aspects. Over-sampling is a popular and simple method for imbalanced learning. In this paper, we show that there is an inherently potential risk associated with the oversampling algorithms in terms of the large margin principle. Then we propose a new synthetic over sampling method, named Mar...

متن کامل

Oversampling Method for Imbalanced Classification

Classification problem for imbalanced datasets is pervasive in a lot of data mining domains. Imbalanced classification has been a hot topic in the academic community. From data level to algorithm level, a lot of solutions have been proposed to tackle the problems resulted from imbalanced datasets. SMOTE is the most popular data-level method and a lot of derivations based on it are developed to ...

متن کامل

A Novel One Sided Feature Selection Method for Imbalanced Text Classification

The imbalance data can be seen in various areas such as text classification, credit card fraud detection, risk management, web page classification, image classification, medical diagnosis/monitoring, and biological data analysis. The classification algorithms have more tendencies to the large class and might even deal with the minority class data as the outlier data. The text data is one of t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Algorithms

سال: 2021

ISSN: ['1999-4893']

DOI: https://doi.org/10.3390/a14020054